
The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form 



Application Serial Number: 0 tfff3$^ 9 ?Q RBC^^ 



Date Processed by STIC: //^WZoPI CE^W ^ 00/290 ° 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(aiuspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http:/Av\v\v.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER 



IER: 



: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 
Wrapped Nucleics The numberAext at the end of each line \vrapped" down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "Vvrapping". 

Wrapped Aminos The amino acid numberAext at the end of each line Vrapped " down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent ""wrapping". 

Incor/ect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 



Misaligned Amino Acid 
Numbering 

Non-ASCII 



The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. It is recommended to delete any tabs and use spacingfEetween fhe numbers. 

This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 



Variable Length Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 



Patentln ver. 2.0 "bug" 



A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 



Please also adjust the "(iii) NUMBER OF SEQUENCES:** response to include the skipped sequence(s). 



Skipped Sequences 
(NEW RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
*400> sequence id number 
000 



Use of n's* or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 

(NEW ffilL^)" Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa. and which residue n or Xaa represents. 



Use of <213>Organism Sequence(s)_ 
(NEW RULES) 



are missing this mandatory field or its response. 



Use of <220>Feature 
(NEW RULES) 



Sequence(s) are missing the <220> Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGANISfvf is "Artificial" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol. 63*. No. 104, pp. 29631-32) (Sec. i .823 of new Rules) 



Patentln ver. 2.0 "bug*" 



.Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 
~nre" Te'stming in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 
Instead, please use "File Manager or any other means to copy file to floppy disk. 
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;110> 



<120> 



<130> 
<140> 
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<150> 
<151> 
<160> 
<170> 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 35,770 

Input Set : A:\Yamamtol.app 

Output Set: N:\CRF3\01242001\I435770. 

APPLICANT: YAMAMOTO, Takuo 
MARUTA, Kazuhiko . 
KUBOTA, Michio 
FUKUDA , Shigeharu 
MIYAKE, Toshio 

TITLE OF INVENTION: NON- REDUCING SACCHARIDE- 
TREHALOSE - RELEASING ENZYME, AND PROCESS FOR 
SACCHARIDES USING THE ENZYMES 
FILE REFERENCE: YAMAMOTO=16A 
CURRENT APPLICATION NUMBER: 09/435,770 
CURRENT FILING DATE: 1999-11-08 
PRIOR APPLICATION NUMBER: JP 258,394/1998 
PRIOR FILING DATE: 1998-09-11 
PRIOR APPLICATION NUMBER: JP 352,252/1998 
PRIOR FILING DATE: 1998-12-11 
PRIOR APPLICATION NUMBER: JP 16,931/1999 
PRIOR FILING DATE: 1999-01-26 
NUMBER OF SEQ ID NOS : 3 9 
SOFTWARE: Patentln Ver. 2.1 



DATE: 01/24/2001 
TIME: 14:32:13 



raw 



Does Not Comply 
Corrected Diskette Needed 



FORMING ENZYME, 
PRODUCING 



DECEIVED 

FEB 0 2 2001 

TECH CENTER 1600/2908 



ERRORED SEQUENCES 







241 






242 






243 






244 






246 


E- 


-> 


247 






248 


E- 


-> 


249 






250 


E- 


-> 


251 






252 


E- 


-> 


253 






254 


E- 


-> 


255 






256 


E- 


-> 


257 






258 


E- 


-> 


259 






260 


E- 


-> 


261 






262 


E- 


-> 


263 






264 


E- 


-> 


265 



<210> SEQ ID NO: 7 
<211> LENGTH: 2268 
<212> TYPE: DNA 

<213> ORGANISM: ARTHROBACTER sp . 
<400> SEQUENCE: 7 
cccgccagta cctaccgcct tcagatctcg 
60 , . 



S34 

gcggagttca ccctcttcga 



atcgtgccct acctgcaccg cctcggcgcc 
120 _ 



gactggctgt acctctcgcc 




tccgagtcgg gctcctcgca cggctacgac gtggtcgacc actcccgcgt cgacgccgcc 

180 

cgcggcgggc cggaggggct cgccgagctc tcccgtgcgg cgcacgagcg cggcatgggc 

240 

gtcgtcgtcg acatcgtgcc caaccacgtc ggcgtcgcga cgccgaaggc gaaccgctgg 

300 

tggtgggacg ttctggcccg tggacagcgg tcggagtacg ccgactactt cgacatcgac 

360 

tgggagttcg gcggcggcag gctgcgcctg cccgtgctcg gcgacggccc cgacgagctc 

420 

gacgcgctga gagtggatgg cgacgagctc gtctactacg agcaccgctt cccgatcgcc 

480 

gagggcaccg gcggcggcac cccgcgcgag gtgcacgacc ggcagcacta cgagctgatg 

540 

tcgtggcggc gggccgacca cgacctcaac taccgccgct tcttcgccgt gaacacgctc 



file://C:\CRF3\OutholcKVsrI435770.htm 
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RAW SEQUENCE LISTING DATE: 01/24/2001 

PATENT APPLICATION: US/09/4 3 5 , 77 0 TIME: 14:32:13 



Input Set : A:\Yamamtol.app 

Output Set: N:\CRF3\01242001\I435770.raw 

266 600 

E--> 267 gccgccgtac gcgtcgaaga cccgcgcgtg ttcgacgaca cccaccgcga gatcggccgc 
268 660 

E--> 2 69 tggatcgccg agggcctcgt cgacggcctg cgcgtcgacc accccgacgg gctgcgcgcc 

270 720 

E--> 271 cccggcgact acctgcgccg tctcgccgag ctcgcccaag gcaggccgat ctgggtcgag 

272 780 

E--> 27 3 aagatcatcg agggcgacga gcggatgccc ccgcagtggc ccatcgccgg caccaccggc 

274 840 

E--> 27 5 tacgacgcgc tggccgggat cgaccgggtg ctcgtcgacc ccgcgggcga gcatccgctc 

276 900 

E--> 2 77 acccagatcg tcgacgaggc ggcaggcagc ccccggcgct gggccgagct ggttcccgag 

278 960 

E--> 2 79 cgcaagcggg ccgtcgcccg cggcatcctg aactccgaga tccgccgcgt cgcccgcgaa 

280 1020 

E--> 2 81 ctcggagagg tcgccggcga cgtcgaagac gcgctcgtcg agatcgccgc cgccctgtcc 

282 1080 

E--> 28 3 gtctaccgca gctacctgcc gttcgggcgc gagcacctcg acgaagccgt ggccgccgcg 

284 1140 

E--> 285 caggccgcag ccccccagct cgaggccgac ctcgccgccg tcggcgcagc gctcgccgac 

286 1200 

E--> 287 ccgggcaacc ccgccgcgct ccgcttccag cagaccagcg gcatgatcat ggccaagggc 

288 1260 

E--> 28 9 gtcgaggaca acgcgttcta ccgctacccc cggctcacct cgctgaccga ggtcggggga 

290 1320 

E--> 291 gacccgagcc tgttcgcgat cgacgcggcc gccttccacg cggcgcagcg cgaccgcgcc 

292 1380 

E--> 293 gcccggctgc ccgagtcgat gacgacgctg accacccacg acaccaagcg cagcgaagac 

294 1440 

E--> 2 95 acccgggcgc ggatcaccgc gctcgccgag gcccccgaac gctggcggcg cttcctgacc 

296 1500 

E--> 297 gaggtcggcg ggctcatcgg aacgggcgac cgggtgctgg agaacctgat ctggcaggcg 

298 1560 

E--> 299 atcgtcggcg cgtggccggc gagccgggag cggctcgagg cctacgcgct gaaggccgcg 

300 1620 

E--> 3 01 cgcgaagccg gcgagtcgac cgactggatc gacggcgacc ccgcgttcga agagcggctg 

302 1680 

E--> 3 03 acccgcctgg tcacggtcgc cgtcgaggag ccgctcgtgc acgagctgct cgagcggctc 

304 1740 

E--> 305 gtcgacgagc tgacggcggc cgggtactcc aacggcctcg cggcgaagct gctgcagctg 

306 1800 

E--> 307 ctcgcccccg gaacccccga cgtgtaccag ggcacggaac gctgggaccg gtcgctggtg 

308 1860 

E--> 3 09 gacccggaca accgtcgccc cgtggatttc gccgcggcat ccgagctgct cgaccgcctc 

310 1920 

E--> 311 gacggcggct ggcggccgcc cgtcgacgag accggcgcgg tcaagacgct cgtcgtctcc 

312 1980 

E--> 313 cgcgcgctgc ggctgcgccg cgaccggccc gagctgttca ccgcgtacca cccggtcacg 

314 2040 
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RAW SEQUENCE LISTING DATE: 01/24/2001 

PATENT APPLICATION: US/09/4 35,770 TIME: 14:32:13 

Input Set : A:\Yamamtol.app 

Output Set: N:\CRF3\01242001\I435770.raw RECEIVE! 



E--> 315 gcgcgcggcg cgcaggccga gcacctgatc ggcttcgacc gcggcggcgc gatcgccctg 

316 2100 

E--> 317 gccacccgcc tgccgctcgg cctcgccgcc gcaggcggct ggggcgacac ggtcgtcgac 

318 2160 

E--> 319 gtcggcgagc ggagcctgcg cgacgagctg accggccgcg aggcccgcgg agcggcgcgc 

320 2220 

E--> 321 gtggccgagt tgttcgccga ctaccccgtc gccctgctgg tggagaca 

322 2268 

325 <210> SEQ ID NO : 8 

326 <211> LENGTH: 28 

327 <212> TYPE: DNA 

328 <213> ORGANISM: ARTHROBACTER sp.S34 
330 <400> SEQUENCE : 8 

E--> 3 31 ttttttaata aaatcaggag gaaaaaat 

332 28 

529 <210> SEQ ID NO: 17 

530 <211> LENGTH: 1725 

531 <212> TYPE: DNA 

532 <213> ORGANISM: ARTHROBACTER sp.S34 
534 <400> SEQUENCE: 17 

E--> 5 35 atgaaccgac gattcccggt ctgggcgccc caggccgcgc aggtgacgct cgtcgtgggc 

536 60 

E--> 537 caaggccgcg ccgaactccc gctgacccgc gacgagaacg gatggtgggc tcttcagcag 

538 120 

E--> 53 9 ccgtgggacg gcggccccga cctcgtcgac tacggctacc tcgtcgacgg caagggcccc 

540 180 

E--> 541 ttcgccgacc cgcggtcgct gcggcagccg cgcggcgtgc acgagctcgg ccgcgaattc 

542 240 

E--> 54 3 gaccccgccc gctacgcgtg gggcgacgac ggatggcgcg gccgagacct caccggagcc 

544 300 

E--> 54 5 gtgatctacg aactgcacgt cggcaccttc acccctgagg gaacgctgga cagcgccatc 

546 360 

E--> 54 7 cgtcgcctcg accacctggt gcgcctcggc gtcgacgcgg tcgagctgct gcccgtcaac 

548 420 

E--> 549 gcgttcaacg gcacccacgg ctggggctac gacggggtgc tctggtacgc ggtgcacgag 

550 480 

E--> 551 ccctacggcg gcccggaggc gtaccagcgc ttcgtcgacg cctgccacgc ccgcggcctc 

552 540 

E--> 553 gccgtcgtgc aggacgtcgt ctacaaccac ctgggcccga gcggcaacca cctgcccgac 

554 600 

E--> 555 ttcggcccct acctcgggtc gggcgccgcc aacacctggg gcgacgcgct gaacctcgac 

556 660 

E--> 557 gggccgctct ccgacgaggt gcggcggtac atcatcgaca acgcggtgta ctggctgcgc 

558 720 

E--> 559 gacatgcacg ccgacgggct gcggctcgac gccgtgcacg cgctgcgcga cgcccgcgcg 

560 780 

E--> 561 ctgcacctgc tcgaagagct cgccgcccgc gtcgacgagc tggcgggcga gctcggccgg 

562 840 

E--> 563 ccgctgacgc tcatcgccga gagcgacctg aacgacccga agctgatccg ctcccgcgcg 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 35,770 



DATE: 01/24/2001 
TIME: 14:32:13 



Input Set : A:\Yamamtol.app 

Oi/tput Set: N:\CRF3\01242001\I435770.raw 



564 900 

E--> 565 gcgcacggct acggcctcga cgcccagtgg gacgacgacg tgcaccacgc ggtgcacgcc 

566 960 

E--> 5 67 aacgtgaccg gcgagaccgt cggctactac gccgacttcg gcgggctcgg cgccctcgtc 

568 1020 

E--> 569 aaggtgttcc agcgcggctg gttccacgac ggcacctggt cgagcttccg cgagcggcac 

570 1080 

E--> 571 cacggccggc cgctcgaccc cgacatcccg ttccgccggc tcgtcgcctt cgcgcaggat 

572 1140 

E--> 57 3 cacgaccagg tcggcaaccg agcggtcggc gaccgcatgt cggcgcaggt cggcgagggt 

574 1200 

E--> 57 5 tcgctcgccg ccgcggcggc gctcgtgctg ctcggcccgt tcaccccgat gctgttcatg 

576 1260 

E--> 577 ggcgaggagt ggggcgcgcg caccccgtgg cagttcttca cctcccaccc cgagcccgag 

578 1320 

E--> 579 ctgggggagg cgacggcgcg cgggcgcatc gccgagttcg cccgcatggg ctgggacccg 

580 1380 

E--> 581 gcagtcgtgc ccgacccgca ggacccggcc accttcgccc gctcgcacct ggactggtcc 



E--> 583 gagcccgagc gggaaccgca cgcgggcctg ctcgccttct acaccgacct gatcgcgctg 

584 1500 

E--> 585 cggcgcgagc tgccggtcga tgcgccggcg cgcgaggtgg atgccgacga ggcgcgcggc 

586 1560 

E--> 5 87 gtcttcgcgt tcagccgcgg cccgctgcgg gtcacggtcg cgctgcgccc cggaccggtc 

588 1620 

E--> 5 89 ggggtgcccg agcacggggg cctcgtgctc gcctacggcg aggtgcgcgc cggcgccgcc 

590 1680 

E--> 5 91 ggactgcacc tcgacgggcc gggagccgcg atcgtgcgcc tcgag 

592 1725 

595 <210> SEQ'ID NO: 18 

596 <211> LENGTH: 23 

597 <212> TYPE: DNA 

598 <213> ORGANISM: ARTHROBACTER sp.S34 
600 <400> SEQUENCE: 18 

E--> 601 gcsaaccgst ggtggtggga cgt 

602 23 

605 <210> SEQ ID NO: 19 

606 <211> LENGTH: 3252 

607 <212> TYPE: DNA 

608 <213> ORGANISM: ARTHROBACTER sp.S34 

610 <220> FEATURE: 

611 <221> NAME/KEY: 5 * UTR 

612 <222> LOCATION: (1)..(742) 

614 <220> FEATURE: 

615 <221> NAME/KEY: CDS 

616 <222> LOCATION: ( 74 3 ) . . ( 3 013 ) 

618 <220> FEATURE: 

619 <221> NAME/KEY: 3 * UTR 

620 <222> LOCATION: ( 74 2 ) . . ( 3014 ) 



582 1440 
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RAW SEQUENCE LISTING DATE: 01/24/2 001 

PATENT APPLICATION: US/09/4 35 f 770 TIME: 14:32:13 

Input Set : A:\Yamamtol.app 

Output Set: N:\CRF3\01242001\I4 357 70.raw 

622 <400> SEQUENCE: 19 
E--> 62 3 atgccgacga cgaacttgag cgcgttctcg ggcacccgcg agagcggtcc gcgcacggcg 

624 60 ' 

E--> 626 gcgcccagtg ccacgacgag cacgatcgcg gcgagcgccg cgacgacggc gaccggcagg 

627 120 

E--> 629 cgcccctgat tgctggcgaa ggtgagcacg atgaagacca cctcgaggcc ctcgagcaac 

630 180 

E--> 63 2 acacctttga acgacacggt gaacgcgtac caatcggaga ccccgaaccg gctctcgcgc 

633 240 

E--> 63 5 cgggcgctct cggccgcctc gacctgacgc cggaaggcag cctcctcgtc acggagagcc 

636 300 

E--> 63 8 ctgcgccctg ccgcgcgcag caccgccttg cgcagccagc cgagcccgaa gacgagcagc 

639 360 

E--> 641 aacccgccga cgacgaggcg cagcacggcc agcggcagca gcaggatcgc gggaccgacg 

642 420 

E--> 64 4 agcgcgacgg ccgcggccag caccaccacg gcgacggcgg cacctgtcag cgccgaccgc 

645 480 

E--> 64 7 cagctgcggg tggcgccgac cgcgacgacg atcgtggtcg cctccaccgc ctcgaccacg 

648 540 

E--> 650 caggcgagga acacggcggc gaacagggcg acggcggtca tcggcccagc agacggttga 

651 600 

E--> 653 ccatcacggc acgctagcgc cattgctcac aggaagggcc aagacgcccg caacgcggca 

654 660 

E--> 65 6 cccgtggacg gcgcgtaccg gcgtgtgacc gatcgtgtca accggtggcg cccgccccga 

657 720 

E--> 659 gcacctgcgt agattcggcc tc gtg ccc gcc agt acc tac cgc ctt cag ate 

660 772 

661 Met Pro Ala Ser Thr Tyr Arg Leu Gin He 

662 1 5 10 
E--> 664 teg gcg gag ttc acc etc ttc gac gcg gcg cgc ate gtg ccc tac ctg 

665 820 

666 Ser Ala Glu Phe Thr Leu Phe Asp Ala Ala Arg He Val Pro Tyr Leu 

667 15 20 25 
E--> 669 cac cgc etc ggc gcc gac tgg ctg tac etc teg ccg ctg etc gag tec 

670 868 

671 His Arg Leu Gly Ala Asp Trp Leu Tyr Leu Ser Pro Leu Leu Glu Ser 

672 30 35 40 

E--> 674 gag teg ggc tec teg cac ggc tac gac gtg gtc gac cac tec cgc gtc 

675 916 

676 Glu Ser Gly Ser Ser His Gly Tyr Asp Val Val Asp His Ser Arg Val 

677 45 50 55 

E--> 67 9 gac gcc gcc cgc ggc ggg ccg gag ggg etc gcc gag etc tec cgt gcg 

680 964 

681 Asp Ala Ala Arg Gly Gly Pro Glu Gly Leu Ala Glu Leu Ser Arg Ala 

682 60 65 70 

E--> 684 gcg cac gag cgc ggc atg ggc gtc gtc gtc gac ate gtg ccc aac cac 

685 1012 

686 Ala His Glu Arg Gly Met Gly Val Val Val Asp He Val Pro Asn His 

687 75 80 85 90 
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RAW SEQUENCE LISTING DATE : 01/24/2001 

PATENT APPLICATION: US/09/4 35,7 7 0 TIME: 14:32:13 



Input Set : A:\Yamamtol.app 

Output Set: N:\CRF3\01242001\I435770.raw 

E--> 689 gtc ggc gtc gcg acg ccg aag gcg aac cgc tgg tgg tgg gac gtt ctg 

690 1060 

691 Val Gly Val Ala Thr Pro Lys Ala Asn Arg Tx^p Trp Trp Asp Val Leu 

692 95 100 105 
E--> 694 gcc cgt gga cag egg teg gag tac gec gac tac ttc gac ate gac tgg 

695 1108 

696 Ala Arg Gly Gin Arg Ser Glu Tyr Ala Asp Tyr Phe Asp He Asp Trp 

697 110 115 120 

E--> 699 gag ttc ggc ggc ggc agg ctg cgc ctg ccc gtg etc ggc gac ggc ccc 

700 1156 

701 Glu Phe Gly Gly Gly Arg Leu Arg Leu Pro Val Leu Gly Asp Gly Pro 

702 125 130 135 

E--> 704 gac gag etc gac gcg ctg aga gtg gat ggc gac gag etc gtc tac tac 

705 1204 

706 Asp Glu Leu Asp Ala Leu Arg Val Asp Gly Asp Glu Leu Val Tyr Tyr 

707 140 145 150 

E--> 70 9 gag cac cgc ttc ccg ate gcc gag ggc ace ggc ggc ggc acc ccg cgc 

710 1252 

711 Glu His Arg Phe Pro He Ala Glu Gly Thr Gly Gly Gly Thr Pro Arg 

712 155 160 165 170 
E--> 714 gag gtg cac gac egg cag cac tac gag ctg atg teg tgg egg egg gcc 

715 1300 

716 Glu Val His Asp Arg Gin His Tyr Glu Leu Met Ser Trp Arg Arg Ala 

717 175 180 185 
E--> 719 gac cac gac etc aac tac cgc cgc ttc ttc gcc gtg aac acg etc gcc 

720 1348 

721 Asp His Asp Leu Asn Tyr Arg Arg Phe Phe Ala Val Asn Thr Leu Ala 

722 190 195 200 

E--> 724 gcc gta cgc gtc gaa gac ccg cgc gtg ttc gac gac acc cac cgc gag 

725 1396 

726 Ala Val Arg Val Glu Asp Pro Arg Val Phe Asp Asp Thr His Arg Glu 

727 205 210 215 

E--> 729 ate ggc cgc tgg ate gcc gag ggc etc gtc gac ggc ctg cgc gtc gac 

730 1444 

731 He Gly Arg Trp He Ala Glu Gly Leu Val Asp Gly Leu Arg Val Asp 

732 220 225 230 

E--> 734 cac ccc gac ggg ctg cgc gcc ccc ggc gac tac ctg cgc cgt etc gcc 

735 1492 

736 His Pro Asp Gly Leu Arg Ala Pro Gly Asp Tyr Leu Arg Arg Leu Ala 

737 235 240 245 250 
E--> 73 9 gag etc gcc caa ggc agg ccg ate tgg gtc gag aag ate ate gag ggc 

740 1540 

741 Glu Leu Ala Gin Gly Arg Pro He Trp Val Glu Lys He He Glu Gly 

742 255 . 260 265 
E--> 74 4 gac gag egg atg ccc ccg cag tgg ccc ate gcc ggc acc acc ggc tac 

745 1588 

746 Asp Glu Arg Met Pro Pro Gin Trp Pro He Ala Gly Thr Thr Gly Tyr 

747 270 275 280 

E--> 74 9 gac gcg ctg gcc ggg ate gac egg gtg etc gtc gac ccc gcg ggc gag 
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E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



750 
751 
752 
754 
755 
756 
757 
759 
760 
761 
762 
764 
765 
766 
767 
769 
770 
771 
772 
774 
775 
776 
777 
779 
780 
781 
782 
784 
785 
786 
787 
789 
790 
791 
792 
794 
795 
796 
797 
799 
800 
801 
802 
804 
805 
806 
807 
809 
810 



1636 
Asp Ala 

cat ccg 

1684 
His Pro 
300 
tgg gcc 
17 32 
Trp Ala 
315 

ctg aac 

1780 
Leu Asn 

ggc gac 

1828 
Gly Asp 

tac cgc 

1876 
Tyr Arg 

gcc gcc 

1924 
Ala Ala 
380 
gtc ggc 

1972 
Val Gly 
395 

cag cag 

2020 
Gin Gin 

ttc tac 

2068 
Phe Tyr 

ccg age 

2116 
Pro Ser 

gac cgc 

2164 
Asp Arg 
460 
gac acc 
2212 



Leu Ala 
285 

etc acc 

Leu Thr 

gag ctg 

Glu Leu 

tec gag 

Ser Glu 

gtc gaa 

Val Glu 
350 
age tac 

Ser Tyr 
365 

gcg cag 

Ala Gin 

gca gcg 

Ala Ala 

acc age 

Thr Ser 

cgc tac 

Arg Tyr 
430 
ctg ttc 

Leu Phe 
445 

gcc gcc 
Ala Ala 
aag cgc 



Gly lie Asp 

cag ate gtc 

Gin lie Val 
305 

gtt ccc gag 

Val Pro Glu 
320 

ate cgc cgc 

lie Arg Arg 
335 

gac gcg etc 

Asp Ala Leu 

ctg ccg ttc 

Leu Pro Phe 

gcc gca gcc 

Ala Ala Ala 
385 

etc gcc gac 

Leu Ala Asp 
400 

ggc atg ate 

Gly Met lie 
415 

ccc egg etc 

Pro Arg Leu 

gcg ate gac 

Ala lie Asp 

egg ctg ccc 

Arg Leu Pro 
465 

age gaa gac 



Arg Val Leu 
290 

gac gag gcg 

Asp Glu Ala 

cgc aag egg 

Arg Lys Arg 

gtc gcc cgc 

Val Ala Arg 
340 

gtc gag ate 

Val Glu lie 
355 

ggg cgc gag 

Gly Arg Glu 
370 

ccc cag etc 

Pro Gin Leu 

ccg ggc aac 

Pro Gly Asn 

atg gcc aag 

Met Ala Lys 
420 

acc teg ctg 

Thr Ser Leu 
435 

gcg gcc gcc 

Ala Ala Ala 
450 

gag teg atg 

Glu Ser Met 
acc egg gcg 



Val Asp 

gca ggc 

Ala Gly 
310 
gcc gtc 

Ala Val 
325 

gaa etc 

Glu Leu 

gcc gcc 

Ala Ala 

cac etc 

His Leu 

gag gcc 

Glu Ala 
390 
ccc gcc 

Pro Ala 
405 

ggc gtc 

Gly Val 

acc gag 

Thr Glu 

ttc cac 

Phe His 

acg acg 

Thr Thr 
470 
egg ate 



Pro Ala 
295 

age ccc 

Ser Pro 

gcc cgc 

Ala Arg 

gga gag 

Gly Glu 

gcc ctg 

Ala Leu 
360 
gac gaa 

Asp Glu 
375 

gac etc 

Asp Leu 

gcg etc 

Ala Leu 

gag gac 

Glu Asp 

gtc ggg 

Val Gly 
440 
gcg gcg 

Ala Ala 
455 

ctg acc 
Leu Thr 
acc gcg 



Gly Glu 

egg cgc 

Arg Arg 

ggc ate 

Gly lie 
330 
gtc gcc 

Val Ala 
345 

tec gtc 

Ser Val 

gcc gtg 

Ala Val 

gcc gcc 

Ala Ala 

cgc ttc 

Arg Phe 
410 
aac gcg 

Asn Ala 
425 

gga gac 
Gly Asp 
cag cgc 
Gin Arg 
acc cac 
Thr His 
etc gcc 
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811 


Asp Thr 


Lys 


Arg 


Ser 


Glu 


Asp 


Thr 


Arg 


Ala 


Arg 


He 


Thr 


Ala 


Leu 


Ala 






812 


475 








480 










485 










490 


E- 


-> 


814 

815 


gag gcc 

2260 


ccc 


gaa 


cgc 


tgg 


egg 


cgc 


ttc 


ctg 


acc 


gag 


gtc 


ggc 


ggg 


etc 






816 


Glu Ala 


Pro 


Glu 


Arg 


Trp 


Arg 


Arg 


Phe 


Leu 


Thr 


Glu 


Val 


Gly 


Gly 


Leu 






817 








495 










500 










505 




E- 


-> 


819 

820 


ate gga 

2308 


acg 


ggc 


gac 


egg 


gtg 


ctg 


gag 


aac 


ctg 


ate 


tgg 


cag 


gcg 


ate 






821 


He Gly 


Thr 


Gly 


Asp 


Arg 


Val 


Leu 


Glu 


Asn 


Leu 


He 


Trp 


Gin 


Ala 


He 






822 






510 










515 










520 






E- 


-> 


824 

825 


gtc ggc 

2356 


gcg 


tgg 


ccg 


gcg 


age 


egg 


gag 


egg 


etc 


gag 


gcc 


tac 


gcg 


ctg 






826 


Val Gly 


Ala 


Trp 


Pro 


Ala 


Ser 


Arg 


Glu 


Arg 


Leu 


Glu 


Ala 


Tyr 


Ala 


Leu 






827 




525 










530 










535 








E- 


-> 


829 

830 


aag gcc 

2404 


gcg 


cgc 


gaa 


gcc 


ggc 


gag 


teg 


acc 


gac 


tgg 


ate 


gac 


ggc 


gac 






831 


Lys Ala 


Ala 


Arg 


Glu 


Ala 


Gly 


Glu 


Ser 


Thr 


Asp 


Trp 


He 


Asp 


Gly 


Asp 






832 


540 










545 










550 










E- 


-> 


834 

835 


ccc gcg 

2452 


ttc 


gaa 


gag 


egg 


ctg 


acc 


cgc 


ctg 


gtc 


acg 


gtc 


gcc 


gtc 


gag 






836 


Pro Ala 


Phe 


Glu 


Glu 


Arg 


Leu 


Thr 


Arg 


Leu 


Val 


Thr 


Val 


Ala 


Val 


Glu 






837 


555 








560 










565 










570 


E- 


- > 


839 

840 


gag ccg 

2500 


etc 


gtg 


cac 


gag 


ctg 


etc 


gag 


egg 


etc 


gtc 


gac 


gag 


ctg 


acg 






841 


Glu Pro 


Leu 


Val 


His 


Glu 


Leu 


Leu 


Glu 


Arg 


Leu 


Val 


Asp 


Glu 


Leu 


Thr 






842 








575 










580 










585 




E- 


-> 


844 

845 


gcg gcc 

2548 


ggg 


tac 


tec 


aac 


ggc 


etc 


gcg 


gcg 


aag 


ctg 


ctg 


cag 


ctg 


etc 






846 


Ala Ala 


Gly 


Tyr 


Ser 


Asn 


Gly 


Leu 


Ala 


Ala 


Lys 


Leu 


Leu 


Gin 


Leu 


Leu 






847 






590 










595 










600 






E- 


-> 


849 

850 


gcc ccc 

2596 


gga 


acc 


ccc 


gac 


gtg 


tac 


cag 


ggc 


acg 


gaa 


cgc 


tgg 


gac 


egg 






851 


Ala Pro 


Gly 


Thr 


Pro 


Asp 


Val 


Tyr 


Gin 


Gly 


Thr 


Glu 


Arg 


Trp 


Asp 


Arg 






852 




605 










610 










615 








E- 


-> 


854 

855 


teg ctg 

2644 


gtg 


gac 


ccg 


gac 


aac 


cgt 


cgc 


ccc 


gtg 


gat 


ttc 


gcc 


gcg 


gca 






856 


Ser Leu 


Val 


Asp 


Pro 


Asp 


Asn 


Arg 


Arg 


Pro 


Val 


Asp 


Phe 


Ala 


Ala 


Ala 






857 


620 










625 










630 










E- 


-> 


859 

860 


tec gag 

2692 


ctg 


etc 


gac 


cgc 


etc 


gac 


ggc 


ggc 


tgg 


egg 


ccg 


ccc 


gtc 


gac 






861 


Ser Glu 


Leu 


Leu 


Asp 


Arg 


Leu 


Asp 


Gly 


Gly 


Trp 


Arg 


Pro 


Pro 


Val 


Asp 






862 


635 








640 










645 










650 


E- 


-> 


864 

865 


gag acc 

2740 


ggc 


gcg 


gtc 


aag 


acg 


etc 


gtc 


gtc 


tec 


cgc 


gcg 


ctg 


egg 


ctg 






866 


Glu Thr 


Gly 


Ala 


Val 


Lys 


Thr 


Leu 


Val 


Val 


Ser 


Arg 


Ala 


Leu 


Arg 


Leu 






867 








655 










660 










665 




E- 


-> 


869 

870 


cgc cgc 

2788 


gac 


egg 


ccc 


gag 


ctg 


ttc 


acc 


gcg 


tac 


cac 


ccg 


gtc 


acg 


gcg 






871 


Arg Arg 


Asp 


Arg 


Pro 


Glu 


Leu 


Phe 


Thr 


Ala 


Tyr 


His 


Pro 


Val 


Thr 


Ala 
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872 






670 










675 










680 






E--> 


874 

875 


cgc ggc 

2836 


gcg 


cag 


gcc 


gag 


cac 


ctg 


ate 


ggc 


ttc 


gac 


cgc 


ggc 


ggc 


gcg 




876 


Arg Gly 


Ala 


Gin 


Ala 


Glu 


His 


Leu 


He 


Gly 


Phe 


Asp 


Arg 


Gly 


Gly 


Ala 




877 




685 










690 










695 








E--> 


879 

880 


ate gcc 

2884 


ctg 


gcc 


acc 


cgc 


ctg 


ccg 


etc 


ggc 


etc 


gcc 


gcc 


gca 


ggc 


ggc 




881 


He Ala 


Leu 


Ala 


Thr 


Arg 


Leu 


Pro 


Leu 


Gly 


Leu 


Ala 


Ala 


Ala 


Gly 


Gly 




882 


700 










705 










710 










E--> 


884 

885 


tgg ggc 

2932 


gac 


acg 


gtc 


gtc 


gac 


gtc 


ggc 


gag 


egg 


age 


ctg 


cgc 


gac 


gag 




886 


Trp Gly 


Asp 


Thr 


Val 


Val 


Asp 


Val 


Gly 


Glu 


Arg 


Ser 


Leu 


Arg 


Asp 


Glu 




887 


715 








720 










725 










730 


E--> 


889 

890 


ctg acc 

2980 


ggc 


cgc 


gag 


gcc 


cgc 


gga 


gcg 


gcg 


cgc 


gtg 


gcc 


gag 


ttg 


ttc 




891 


Leu Thr 


Gly 


Arg 


Glu 


Ala 


Arg 


Gly 


Ala 


Ala 


Arg 


Val 


Ala 


Glu 


Leu 


Phe 




892 








735 










740 










745 




E--> 


894 


gcc gac 


tac 


ccc 


gtc 


gcc 


ctg 


ctg 


gtg 


gag 


aca 


tgaaccgacg attcccggtc 




895 


3033 
































896 


Ala Asp 


Tyr 


Pro 


Val 


Ala 


Leu 


Leu 


Val 


Glu 


Thr 














897 






750 










755 

















E--> 8 99 tgggcgcccc aggccgcgca ggtgacgctc gtcgtgggcc aaggccgcgc cgaactcccg 

900 3093 

E--> 902 ctgacccgcg aegagaaegg atggtgggct cttcagcagc cgtgggacgg cggccccgac 

903 3153 

E--> 905 ctcgtcgact acggctacct cgtcgacggc aagggcccct tcgccgaccc gcggtcgctg 

906 3213 

E--> 90 8 cggcagccgc gcggcgtgca cgagctcggc cgegaatte 

909 3252 

913 <210> SEQ ID NO: 20 

914 <211> LENGTH: 26 

915 <212> TYPE: DNA 

916 <213> ORGANISM: Artificial Sequence 

918 <220> FEATURE: 

919 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
921 <400> SEQUENCE: 20 

E--> 922 atgcccgcca gtacctaccg ccttca 

923 26 

926 <210> SEQ ID NO: 21 

927 <211> LENGTH: 25 

928 <212> TYPE: DNA 

929 <213> ORGANISM: Artificial Sequence 

931 <220> FEATURE: 

932 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
934 <400> SEQUENCE: 21 

E--> 93 5 tcatgtctcc accagcaggg cgacg 
936 25 

939 <210> SEQ ID NO: 22 

940 <211> LENGTH: 50 
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941 <212> TYPE: DNA 

942 <213> ORGANISM: Artificial Sequence 

944 <220> FEATURE: 

945 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
947 <400> SEQUENCE: 22 

E--> 94 8 aattcttttt taataaaatc aggaggaatc tagatgttta ctagtctgca 
949 50 

952 <210> SEQ ID NO: 23 

953 <211> LENGTH: 42 

954 <212> TYPE: DNA 

955 <213> ORGANISM: Artificial Sequence 

957 <220> FEATURE: 

958 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
960 <400> SEQUENCE: 23 

E--> 961 gactagtaaa catctagatt cctcctgatt ttattaaaaa ag 

962 42 

965 <210> SEQ "ID NO: 24 

966 <211> LENGTH: 33 

967 <212> TYPE: DNA 

968 <213> ORGANISM: Artificial Sequence 

970 <220> FEATURE: 

971 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
973 <400> SEQUENCE : 24 

E--> 974 aaatctagat gcccgccagt acctaccgcc ttc 

975 33 

978 <210> SEQ ID NO: 25 

979 <211> LENGTH: 33 

980 <212> TYPE: DNA 

981 <213> ORGANISM: Artificial Sequence 

983 <220> FEATURE: 

984 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
986 <400> SEQUENCE: 25 

E--> 9 87 aaaactagtt tatcatgtct ccaccagcag ggc 

988 33 

991 <210> SEQ ID NO: 26 

992 <211> LENGTH: 22 

993 <212> TYPE: DNA 

994 <213> ORGANISM: Artificial Sequence 

996 <220> FEATURE: 

997 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
999 <400> SEQUENCE: 26 

E--> 1000 atcggtgatg tcggcgatat ag 
1001 22 

1004 <210> SEQ ID NO: 27 

1005 <211> LENGTH: 29 

1006 <212> TYPE: DNA 

1007 <213> ORGANISM: Artificial Sequence 

1009 <220> FEATURE: 

1010 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
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1012 <400> SEQUENCE: 27 
E--> 1013 gtactggcgg gcatattttt tcctcctga 

1014 29 

1017 <210> SEQ ID NO: 28 

1018 <211> LENGTH: 31 

1019 <212> TYPE: DNA 

1020 <213> ORGANISM: Artificial Sequence 

1022 <220> FEATURE: 

1023 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
1025 <400> SEQUENCE: 28 

E--> 1026 aatcaggagg aaaaaatatg cccgccagta c 
1027 31 

1030 <210> SEQ ID NO: 29 

1031 <211> LENGTH: 22 

1032 <212> TYPE: DNA 

1033 <213> ORGANISM: ' Artificial Sequence 

1035 <220> FEATURE: 

1036 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
1038 <400> SEQUENCE: 29 

E--> 10 3 9 tcgacgatct gggtgagcgg at 
1040 22 

1043 <210> SEQ ID NO: 30 

1044 <211> LENGTH: 22 

1045 <212> TYPE: DNA 

1046 <213> ORGANISM: Artificial Sequence 

1048 <220> FEATURE: 

1049 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
1051 <400> SEQUENCE: 30 

E--> 1052 tcgacgagca cccggtcgat cc 
1053 22 

1056 <210> SEQ ID NO: 31 

1057 <211> LENGTH : 26 

1058 <212> TYPE: DNA 

1059 <213> ORGANISM : Artificial Sequence 

1061 <220> FEATURE: 

1062 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
1064 <400> SEQUENCE: 3: 

E--> 10 65 cartgggayg aygaygt 
1066 26 

1069 <210> SEQ ID NO: 32 

1070 <211> LENGTH: 2218 

1071 <212> TYPE: DNA 

1072 <213> ORGANISM: Artificial Sequence 

1074 <220> FEATURE: 

1075 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 

1077 <220> FEATURE: 

1078 <221> NAME/KEY: CDS 

1079 <222> LOCATION: ( 4 7 7 ) . . ( 2 2 01 ) 
1081 <220> FEATURE: 




<matiun: Description or Arririciai sequence : synthetic 
jtfnfca ycaygc '<r~^ r ' f «tj ' 
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1082 


<221> NAME/KEY: 


3 ' UTR 




























1083 


<222> LOCATION: 


(2202 


) - 


. (2218) 


















E- 


-> 


1085 


<4 00> SEQUENCE: 


33 


























E- 


-> 


1086 


ctgcagctgc 


tcgcccccgg 


aacccccgac 


gtgtaccagg 


geaeggaacg 


ctgggaccgg 






1087 


60 


































E- 


-> 


1089 


tcgctggtgg 


acccggacaa 


ccgtcgcccc 


gtggatttcg 


ccgcggcatc 


cgagctgctc 






1090 


120 


































E- 


-> 


1092 


gaccgcctcg 


aeggeggctg 


gcggccgccc 


gtcgacgaga 


ccggcgcggt 


caagacgctc 






1093 


180 


































E- 


-> 


1095 


gtcgtctccc 


gcgcgctgcg 


gctgcgccgc 


gaccggcccg 


agctgttcac 


cgcgtaccac 






1096 


240 


































E- 


-> 


1098 


ccggtcacgg 


cgcgcggcgc 


geaggecgag 


cacctgatcg 


gcttcgaccg 


cggcggcgcg 






1099 


300 


































E- 


-> 


1101 


atcgccctgg 


ccacccgcct 


gccgctcggc 


ctcgccgccg 


caggeggctg 


gggcgacacg 






1102 


360 


































E- 


-> 


1104 


gtcgtcgacg 


teggegageg 


gagcctgcgc 


gacgagctga 


ccggccgcga 


ggcccgcgga 






1105 


420 


































E - 


-> 


1107 


gcggcgcgcg 


tggccgagtt 


gttcgccgac 


taccccgtcg 


ccctgctggt 


ggagac atg 






1108 


479 






































1109 


































Met 






1110 


































1 


E- 


-> 


1112 


aac 


cga 


cga 


ttc 


ccg 


gtc 


tgg 


gcg 


ccc 


cag 


gee 


gcg 


cag 


gtg 


acg 


etc 






1113 


527 






































1114 


Asn 


Arg 


Arg 


Phe 


Pro 


Val 


Trp 


Ala 


Pro 


Gin 


Ala 


Ala 


Gin 


Val 


Thr 


Leu 






1115 








5 












10 










15 






E- 


-> 


1117 


gtc 


gtg 


ggc 


caa 


ggc 


cgc 


gec 


gaa 


etc 


ccg 


ctg 


acc 


cgc 


gac 


gag 


aac 






1118 


575 






































1119 


Val 


Val 


Gly 


Gin 


Gly 


Arg 


Ala 


Glu 


Leu 


Pro 


Leu 


Thr 


Arg 


Asp 


Glu 


Asn 






1120 






20 












25 










30 








E - 


- > 


1122 


gga 


tgg 


tgg 


get 


ctt 


cag 


cag 


ccg 


tgg 


gac 


ggc 


ggc 


ccc 


gac 


etc 


gtc 






1123 


623 






































1124 


Gly 


Trp 


Trp 


Ala 


Leu 


Gin 


Gin 


Pro 


Trp 


Asp 


Gly 


Gly 


Pro 


Asp 


Leu 


Val 






1125 




35 












40 










45 










E- 


-> 


1127 


gac 


tac 


ggc 


tac 


etc 


gtc 


gac 


ggc 


aag 


ggc 


ccc 


ttc 


gee 


gac 


ccg 


egg 






1128 


671 






































1129 


Asp 


Tyr 


Gly 


Tyr 


Leu 


Val 


Asp 


Gly 


Lys 


Gly 


Pro 


Phe 


Ala 


Asp 


Pro 


Arg 






1130 


50 










55 










60 










65 


E- 


-> 


1132 


teg 


ctg 


egg 


cag 


ccg 


cgc 


ggc 


gtg 


cac 


gag 


etc 


ggc 


cgc 


gaa 


ttc 


gac 






1133 


719 






































1134 


Ser 


Leu 


Arg 


Gin 


Pro 


Arg 


Gly 


Val 


His 


Glu 


Leu 


Gly 


Arg 


Glu 


Phe 


Asp 






1135 










70 












75 










80 




E- 


-> 


1137 


ccc 


gec 


cgc 


tac 


gcg 


tgg 


ggc 


gac 


gac 


gga 


tgg 


cgc 


ggc 


cga 


gac 


etc 






1138 


767 






































1139 


Pro 


Ala 


Arg 


Tyr 


Ala 


Trp 


Gly 


Asp 


Asp 


Gly 


Trp 


Arg 


Gly 


Arg 


Asp 


Leu 






1140 








85 












90 










95 






E - 


-> 


1142 


acc 


gga 


gee 


gtg 


ate 


tac 


gaa 


ctg 


cac 


gtc 


ggc 


acc 


ttc 


acc 


cct 


gag 






1143 


815 






































1144 


Thr 


Gly 


Ala 


Val 


He 


Tyr 


Glu 


Leu 


His 


Val 


Gly 


Thr 


Phe 


Thr 


Pro 


Glu 






1145 






100 












105 










110 
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E- 


-> 


1147 

1148 


gga acg 

863 


ctg 


gac 


age 


gee 


ate 


cgt 


cgc 


etc 


gac 


cac 


ctg 


gtg 


cgc 


etc 






1149 


Gly Thr 


Leu 


Asp 


Ser 


Ala 


He 


Arg 


Arg 


Leu 


Asp 


His 


Leu 


Val 


Arg 


Leu 






1150 


115 










120 










125 










E- 


-> 


1152 

1153 


ggc gtc 

911 


gac 


gcg 


gtc 


gag 


ctg 


ctg 


ccc 


gtc 


aac 


gcg 


ttc 


aac 


ggc 


acc 






1154 


Gly Val 


Asp 


Ala 


Val 


Glu 


Leu 


Leu 


Pro 


Val 


Asn 


Ala 


Phe 


Asn 


Gly 


Thr 






1155 


130 








135 










140 










145 


E- 


-> 


1157 

1158 


cac ggc 

959 


tgg 


ggc 


tac 


gac 


ggg 


gtg 


etc 


tgg 


tac 


gcg 


gtg 


cac 


gag 


ccc 






1159 


His Gly 


Trp 


Gly 


Tyr 


Asp 


Gly 


Val 


Leu 


Trp 


Tyr 


Ala 


Val 


His 


Glu 


Pro 






1160 








150 










155 










160 




E- 


-> 


1162 

1163 


tac ggc 

1007 


ggc 


ccg 


gag 


gcg 


tac 


cag 


cgc 


ttc 


gtc 


gac 


gec 


tgc 


cac 


gee 






1164 


Tyr Gly 


Gly 


Pro 


Glu 


Ala 


Tyr 


Gin 


Arg 


Phe 


Val 


Asp 


Ala 


Cys 


His 


Ala 






1165 






165 










170 










175 






E- 


-> 


1167 

1168 


cgc ggc 

1055 


etc 


gee 


gtc 


gtg 


cag 


gac 


gtc 


gtc 


tac 


aac 


cac 


ctg 


ggc 


ccg 






1169 


Arg Gly 


Leu 


Ala 


Val 


Val 


Gin 


Asp 


Val 


Val 


Tyr 


Asn 


His 


Leu 


Gly 


Pro 






1170 




180 










185 










190 








E- 


-> 


1172 

1173 


age ggc 

1103 


aac 


cac 


ctg 


ccc 


gac 


ttc 


ggc 


ccc 


tac 


etc 


ggg 


teg 


ggc 


gee 






1174 


Ser Gly 


Asn 


His 


Leu 


Pro 


Asp 


Phe 


Gly 


Pro 


Tyr 


Leu 


Gly 


Ser 


Gly 


Ala 






1175 


195 










200 










205 










E- 


-> 


1177 

1178 


gec aac 

1151 


ace 


tgg 


ggc 


gac 


gcg 


ctg 


aac 


etc 


gac 


ggg 


ccg 


etc 


tec 


gac 






1179 


Ala Asn 


Thr 


Trp 


Gly 


Asp 


Ala 


Leu 


Asn 


Leu 


Asp 


Gly 


Pro 


Leu 


Ser 


Asp 






1180 


210 








215 










220 










225 


E- 


-> 


1182 

1183 


gag gtg 

1199 


egg 


egg 


tac 


ate 


ate 


gac 


aac 


gcg 


gtg 


tac 


tgg 


ctg 


cgc 


gac 






1184 


Glu Val 


Arg 


Arg 


Tyr 


He 


He 


Asp 


Asn 


Ala 


Val 


Tyr 


Trp 


Leu 


Arg 


Asp 






1185 








230 










235 










240 




E- 


-> 


1187 

1188 


atg cac 

1247 


gec 


gac 


ggg 


ctg 


egg 


etc 


gac 


gee 


gtg 


cac 


gcg 


ctg 


cgc 


gac 






1189 


Met His 


Ala 


Asp 


Gly 


Leu 


Arg 


Leu 


Asp 


Ala 


val 


His 


Ala 


Leu 


Arg 


Asp 






1190 






245 










250 










255 






E- 


-> 


1192 

1193 


gec cgc 

1295 


gcg 


ctg 


cac 


ctg 


etc 


gaa 


gag 


etc 


gee 


gee 


cgc 


gtc 


gac 


gag 






1194 


Ala Arg 


Ala 


Leu 


His 


Leu 


Leu 


Glu 


Glu 


Leu 


Ala 


Ala 


Arg 


Val 


Asp 


Glu 






1195 




260 










265 










270 








E- 


-> 


1197 

1198 


ctg gcg 

1343 


ggc 


gag 


etc 


ggc 


egg 


ccg 


ctg 


acg 


etc 


ate 


gee 


gag 


age 


gac 






1199 


Leu Ala 


Gly 


Glu 


Leu 


Gly 


Arg 


Pro 


Leu 


Thr 


Leu 


He 


Ala 


Glu 


Ser 


Asp 






1200 


275 










280 










285 










E- 


-> 


1202 

1203 


ctg aac 

1391 


gac 


ccg 


aag 


ctg 


ate 


cgc 


tec 


cgc 


gcg 


gcg 


cac 


ggc 


tac 


ggc 






1204 


Leu Asn 


Asp 


Pro 


Lys 


Leu 


He 


Arg 


Ser 


Arg 


Ala 


Ala 


His 


Gly 


Tyr 


Gly 






1205 


290 








295 










300 










305 


E- 


-> 


1207 


etc gac 


gee 


cag 


tgg 


gac 


gac 


gac 


gtg 


cac 


cac 


gcg 


gtg 


cac 


gee 


aac 
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1208 1439 

1209 Leu Asp Ala Gin Trp Asp Asp Asp Val His His Ala Val His Ala Asn 

1210 310 315 320 
E--> 1212 gtg acc ggc gag acc gtc ggc tac tac gcc gac ttc ggc ggg etc ggc 

1213 1487 

1214 Val Thr Gly Glu Thr Val Gly Tyr Tyr Ala Asp Phe Gly Gly Leu Gly 

1215 325 330 335 

E--> 1217 gcc etc gtc aag gtg ttc cag cgc ggc tgg ttc cac gac ggc acc tgg 

1218 1535 

1219 Ala Leu Val Lys Val Phe Gin Arg Gly Trp Phe His Asp Gly Thr Trp 

1220 340 345 350 

E--> 1222 teg age ttc cgc gag egg cac cac ggc egg ccg etc gac ccc gac ate 

1223 1583 

1224 Ser Ser Phe Arg Glu Arg His His Gly Arg Pro Leu Asp Pro Asp lie 

1225 355 360 365 

E--> 122 7 ccg ttc cgc egg etc gtc gcc ttc gcg cag gat cac gac cag gtc ggc 

1228 1631 

1229 Pro Phe Arg Arg Leu Val Ala Phe Ala Gin Asp His Asp Gin Val Gly 

1230 370 375 380 385 
E--> 1232 aac cga gcg gtc ggc gac cgc atg teg gcg cag gtc ggc gag ggt teg 

1233 1679 

1234 Asn Arg Ala Val Gly Asp Arg Met Ser Ala Gin Val Gly Glu Gly Ser 

1235 390 395 400 
E--> 12 3 7 etc gcc gcc gcg gcg gcg etc gtg ctg etc ggc ccg ttc acc ccg atg 

1238 1727 

1239 Leu Ala Ala Ala Ala Ala Leu Val Leu Leu Gly Pro Phe Thr Pro Met 

1240 405 410 415 

E--> 1242 ctg ttc atg ggc gag gag tgg ggc gcg cgc acc ccg tgg cag ttc ttc 

1243 1775 

1244 Leu Phe Met Gly Glu Glu Trp Gly Ala Arg Thr Pro Trp Gin Phe Phe 

1245 420 425 430 

E--> 124 7 acc tec cac ccc gag ccc gag ctg ggg gag gcg acg gcg cgc ggg cgc 

1248 1823 

1249 Thr Ser His Pro Glu Pro Glu Leu Gly Glu Ala Thr Ala Arg Gly Arg 

1250 435 440 445 

E--> 12 52 ate gcc gag ttc gcc cgc atg ggc tgg gac ccg gca gtc gtg ccc gac 

1253 1871 

1254 lie Ala Glu Phe Ala Arg Met Gly Trp Asp Pro Ala Val Val Pro Asp 

1255 450 455 460 465 
E--> 1257 ccg cag gac ccg gcc acc ttc gcc cgc teg cac ctg gac tgg tec gag 

1258 1919 

1259 Pro Gin Asp Pro Ala Thr Phe Ala Arg Ser His Leu Asp Trp Ser Glu 

1260 470 475 480 
E--> 1262 ccc gag egg gaa ccg cac gcg ggc ctg etc gcc ttc tac acc gac ctg 

1263 1967 

1264 Pro Glu Arg Glu Pro His Ala Gly Leu Leu Ala Phe Tyr Thr Asp Leu 

1265 485 490 495 

E--> 1267 ate gcg ctg egg cgc gag ctg ccg gtc gat gcg ccg gcg cgc gag gtg 

1268 2015 
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1269 


He Ala 


Leu 


Arg 


Arg 


Glu 


Leu 


Pro 


Val 


Asp 


Ala 


Pro 


Ala 


Arg 


Glu 


Val 




1270 




500 










505 










510 








E--> 


1272 

1273 


gat gcc 

2063 


gac 


gag 


gcg 


cgc 


ggc 


gtc 


ttc 


gcg 


ttc 


age 


cgc 


ggc 


ccg 


ctg 




1274 


Asp Ala 


Asp 


Glu 


Ala 


Arg 


Gly 


Val 


Phe 


Ala 


Phe 


Ser 


Arg 


Gly 


Pro 


Leu 




1275 


515 










520 










525 










E--> 


1277 

1278 


egg gtc 

2111 


acg 


gtc 


gcg 


ctg 


cgc 


ccc 


gga 


ccg 


gtc 


ggg 


gtg 


ccc 


gag 


cac 




1279 


Arg Val 


Thr 


Val 


Ala 


Leu 


Arg 


Pro 


Gly 


Pro 


Val 


Gly 


Val 


Pro 


Glu 


His 




1280 


530 








535 










540 










545 


E--> 


1282 

1283 


ggg ggc 

2159 


etc 


gtg 


etc 


gcc 


tac 


ggc 


gag 


gtg 


cgc 


gcc 


ggc 


gcc 


gcc 


gga 




1284 


Gly Gly 


Leu 


Val 


Leu 


Ala 


Tyr 


Gly 


Glu 


Val 


Arg 


Ala 


Gly 


Ala 


Ala 


Gly 




1285 








550 










555 










560 




E--> 


1287 

1288 


ctg cac 

2201 


etc 


gac 


ggg 


ccg 


gga 


gcc 


gcg 


ate 


gtg 


cgc 


etc 


gag 








1289 


Leu His 


Leu 


Asp 


Gly 


Pro 


Gly 


Ala 


Ala 


He 


Val 


Arg 


Leu 


Glu 








1290 






565 










570 










575 







E--> 1292 tgacgegget gggtacc 

1293 2218 

1296 <210> SEQ ID NO: 33 

1297 <211> LENGTH: 25 

1298 <212> TYPE: DNA 

1299 <213> ORGANISM: Artificial Sequence 

1301 <220> FEATURE: 

1302 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
1304 <400> SEQUENCE: 33 

E--> 13 05 atgaaccgac gattcceggt ctggg 
1306 25 

1308 <210> SEQ ID NO: 34 

1309 <211> LENGTH: 25 

1310 <212> TYPE: DNA 

1311 <213> ORGANISM: Artificial Sequence 

1313 <220> FEATURE: 

1314 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
1316 <400> SEQUENCE: 34 

E--> 1317 tcactcgagg cgcacgatcg egget 
1318 25 
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L 


247 


M:254 E 




NO 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


60 


SEQ 


7 


M 


254 


Repeated 


in 


SeqNo=7 


















L 


331 


M:254 E 




No 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


28 


SEQ 


8 


L 


535 


M:254 E 




No 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


60 


SEQ 


17 


M 


2 54 


Repeated 


in 


SeqNo-17 


















L 


601 


M:254 E 




No 


of Bases 


conflict , 


LENGTH 


Input 


0 


Counted 


23 


SEQ 


18 


L 


623 


M : 254 E 




No 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


60 


SEQ 


19 


M 


254 


Repeated 


in 


SeqNo-19 


















L 


922 


M: 254 E 




No 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


26 


SEQ 


20 


L 


935 


M:254 E 




No 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


25 


SEQ 


21 


L 


948 


M:254 E 




No 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


50 


SEQ 


22 


L 


961 


M:254 E 




No 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


42 


SEQ 


23 


L 


974 


M:254 E 




No 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


33 


SEQ 


24 


L 


987 


M:254 E 




No 


of Bases 


conflict, 


LENGTH 


Input 


0 


Counted 


33 


SEQ- 


25 



L:1000 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:22 SEQ:26 

L:1013 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:29 SEQ:27 

L:1026 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 31 SEQ:28 

L:1039 M:254 E: No. of Bases conflict, LENGTH : Input :0 Counted:22 SEQ:29 

L:1052 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:22 SEQ:30 

L:1065 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID#:31 

L:1065 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID#:31 

L:1065 M:340 W: (46) "n" or "Xaa" used: Feature required, for SEQ ID#:31 

L:1065 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:26 SEQ:31 

L:1085 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQUENCE ID N0S:32 

differs : 33 

L:1086 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:60 SEQ:32 
M:254 Repeated in SeqNo-32 

L:1305 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:25 SEQ:33 

L:1317 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:25 SEQ:34 

L:1330 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:36 SEQ:35 

L:1343 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:36 SEQ:36 

L:1356 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:28 SEQ:37 

L:1369 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:28 SEQ:38 

L:1382 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:22 SEQ:39 
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